Automatic Verb Classiication Using Distributions of Grammatical Features
نویسنده
چکیده
We apply machine learning techniques to classify automatically a set of verbs into lexical semantic classes, based on distributional approximations of diathe-ses, extracted from a very large annotated corpus. Distributions of four grammatical features are suucient to reduce error rate by 50% over chance. We conclude that corpus data is a usable repository of verb class information, and that corpus-driven extraction of grammatical features is a promising methodology for automatic lexical acquisition.
منابع مشابه
Automatic Verb Classification Using Distributions of Grammatical Features
We apply machine learning techniques to classify automatically a set of verbs into lexical semantic classes, based on distributional approximations of diathe-ses, extracted from a very large annotated corpus. Distributions of four grammatical features are sufficient to reduce error rate by 50% over chance. We conclude that corpus data is a usable repository of verb class information, and that c...
متن کاملAutomatic Verb Classiication Using Multilingual Resources
We propose the use of multilingual corpora in the automatic classiication of verbs. We extend the work of (Merlo and Stevenson, 2001), in which statistics over simple syntactic features extracted from textual corpora were used to train an automatic classiier for three lexical semantic classes of English verbs. We hypothesize that some lexical semantic features that are diicult to detect superrc...
متن کاملAutomatic Lexical Acquisition Based on Statistical Distributions
We automatically classify verbs into lexical semantic classes, based on distributions of indicators of verb alternations, extracted from a very large annotated corpus. We address a problem which is particularly di cult because the verb classes, although semantically di erent, show similar surface syntactic behavior. Five grammatical features are su cient to reduce error rate by more than 50% ov...
متن کاملAutomatic Verb Classi cation Using Multilingual Resources
We propose the use of multilingual corpora in the automatic classiication of verbs. We extend the work of (Merlo and Stevenson, 2001), in which statistics over simple syntactic features extracted from textual corpora were used to train an automatic classiier for three lexical semantic classes of English verbs. We hypothesize that some lexical semantic features that are diicult to detect superrc...
متن کاملA Multilingual Paradigm for Automatic Verb Classi cationPaola
We demonstrate the beneets of a multilingual approach to automatic lexical semantic verb classiication based on statistical analysis of corpora in multiple languages. Our research incorporates two interrelated threads. In one, we exploit the similarities in the crosslinguis-tic classiication of verbs, to extend work on English verb classiication to a new language (Italian), and to new classes w...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999